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Abstract —This paper focuses on linear beamforming design 
and power aiiocation strategy for ergodic rate optimization in 
a two-user Muitipie-Input-Singie-Output (MISO) system with 
statisticai and deiayed channei state information at the trans¬ 
mitter (CSIT). We propose a transmission strategy, denoted 
as Statistical Alternative MAT (SAMAT), which exploits both 
channel statistics and deiayed CSIT. Firstly, with statistical 
CSIT only, we focus on statistical beamforming (SBF) design 
that maximizes a lower bound on the ergodic sum-rate. Sec¬ 
ondly, relying on both statistical and deiayed CSIT, an iterative 
algorithm is proposed to compute the precoding vectors of 
Alternative MAT (AMAT), originally proposed by Yang et ai., 
which maximizes an approximation of the ergodic sum-rate with 
equal power allocation. Finally, via proper power allocation, 
the SAMAT framework is proposed to softly bridge between 
SBF and AMAT for an arbitrary number of transmit antennas 
and signai-to-noise ratio (SNR). A necessary condition for the 
power allocation optimization is identified from the Karush- 
Kuhn-Tucker (KKT) conditions. The optimum power allocation 
to maximize an ergodic sum-rate approximation is computed 
using Sequential Quadratic Programming (SQP). Simulation 
results show that the proposed SAMAT scheme yields a significant 
sum-rate enhancement over both SBF and AMAT. 

Index Terms —MISO systems, statistical CSIT, delayed CSIT, 
power allocation, statistical beamforming, SQP. 


I. INTRODUCTION 

I N multi-user (MU) MISO Broadcast Channel (BC), 
schemes that achieve the sum-rate capacity and the capacity 
region have been studied in M. The performance of these 
optimized approaches heavily depends on the acquisition of 
accurate and instantaneous Channel State Information (CSI) at 
the transmitter, which is not feasible in practice due to channel 
estimation errors, limited feedback resources and delay 
ll^ . Moreover, obtaining perfect CSIT can incur unaffordable 
feedback overhead 101 ■ 

In practice, statistical CSI and/or perfect but outdated CSI 
is only available at the transmitter side. The former term 
which is characterized by the channel covariance matrix varies 
slowly and thereby can be easily and accurately acquired 
through long-term feedback. One simple Statistical Beamform¬ 
ing (SBF) approach is to precode the transmitted symbol along 
the weakest eigenvector of the channel covariance matrix of 
the unintended user. Such a scheme generalizes the idea of 
the Perfect CSIT-aided Zero-Forcing Beamforming (PZFBF) 
to the statistical CSIT only environment. Thus, we denote the 
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SBF strategy with Weakest Eigenvector as SWEBF. Recently, 
SBE method with Generalized Eigenvector (SGEBE) has been 
shown to maximize the ergodic sum-rate at high SNR for 
M-user M-transmit-antenna MISO BC when M — 2 ^ or 
M —00 10]. However, the optimal precoder for the general 
M > 2 case is still unknown due to a lack of closed- 
form ergodic sum-rate expression. In the generalized 

eigenvector (GE) solution is arrived based on the ergodic 
signal-to-leakage-and-noise ratio (SLNR), which leverages in¬ 
dependence between the numerator and denominator of SLNR. 
However, the optimal solution to the true problem (ergodic 
sum-rate) is not apparent. In this paper, we focus on a lower 
bound on the ergodic sum-rate and address the problem for 
two-user, arbitrary M-transmit-antenna case. 

Whenever the feedback delay is larger than the channel’s 
coherence time, the CSIT is delayed but was proved to still 
benefit the Degree-of-Ereedom (DoE) of MISO BC 10] ■ In 
the example of two-transmit-antenna, two-user channel, the 
maximum sum DoE of | can be obtained by retransmitting 
the overheard interference and doing interference cancellation. 
This strategy is referred to as MAT. The work iQ generalized 
the finding of |0] as GMAT and achieved a higher data 
rate at finite SNR by constructing precoders which strike a 
balance between desired signal enhancement and interference 
alignment. Moreover, an alternative MAT (AMAT) transmis¬ 
sion strategy was introduced to achieve a sum DoE 
between [|, 2] by utilizing both imperfect current CSIT and 
perfect delayed CSIT. In time correlated channel, the solution 
proposed in Cl smoothly bridges between MAT and PZEBE 
in terms of sum DoE. Inspired by d, we aim to bridge 
between AMAT and SBE in terms of ergodic sum-rate based 
on statistical and delayed CSIT. Although our framework is 
similar to d, the channel models are completely different. 
In our scenario, spatially correlated channel model is assumed. 
Statistical CSIT (full-rank channel covariance matrix) is use¬ 
less for DoE enhancement or equivalent transmission slot 
reduction, since the channel estimation error based only on 
this statistical information does not scale down with SNR. 
Nevertheless, statistical CSIT is useful for boosting the sum- 
rate at finite SNR. 

With both channel statistics and delayed CSIT at hand, 
authors in d developed an enhanced MAT strategy, denoted 
as VMAT, yielding a higher sum-rate than the original MAT 
at finite SNR. However, in highly-correlated channel, the 
rate performance of VMAT is still inferior to SBiQ which 
exploits only statistical CSIT ill31] . In a nutshell, statistical 


’Throughout the paper, SBF refers to either SWEBF or SGEBF. 



2 


channel information is not fully exploited. In Oldtl . authors 
analyze the error rate performance at finite SNR and the 
diversity-multiplexing trade-off at infinite SNR of a space- 
time encoded transmission with delayed and statistical CSIT. 
So far, there has been no investigation on how to further 
enhance the finite SNR sum-rate performance beyond that 
achievable with either statistical CSIT strategies (e.g., SBF) 
or delayed CSIT strategies (e.g., AMAT). In this paper, we 
design a spatial precoding AMAT-style transmission scheme, 
denoted as statistical AMAT (SAMAT), that softly bridges 
the gap between AMAT and SBF at any SNR. With this 
background, the main focus of this paper is to investigate 
power allocation and beamforming optimization. Specifically, 
the main contributions are listed as follows: 


With either statistical or delayed CSIT, the proposed 
SAMAT strategy can directly boil down to SBF or 
AMAT. In the former case, we show that SGEBF is 
optimal to maximize a lower bound on the ergodic sum- 
rate at high SNR. Under both CSIT but equal power 
allocation, statistical CSIT-aided AMAT can achieve a 
significant boost of rate performance relative to the 
original AMAT. More specifically, an efficient iterative 
algorithm is developed to compute the optimal statistical 
precoders to maximize an approximation of the ergodic 
sum-rate. Monotonic convergence of the algorithm is 
proved. For two-user two-transmit-antenna case, any two 
beamforming vectors constituting a unitary matrix are 
certified to be optimal. 

The power allocation can be further optimized to max¬ 
imize the ergodic sum-rate. A SAMAT transmission 
strategy is proposed to bridge the gap between SBF 
and AMAT at any SNR. Due to the complexity of 
deriving a closed-form expression for the ergodic sum- 
rate, a tractable approximation needs to be computed. A 
Sequential Quadratic Programming (SQP) algorithm OlSll 
is implemented to solve the consequent non-linear non- 
convex constrained optimization problem. The necessary 
condition on power allocation optimization is further 
identified. In low spatial correlation channels, the pro¬ 
posed SAMAT scheme boils down to AMAT. In highly 
correlated channels, SAMAT behaves as SBF at low SNR 
while performing as AMAT at high SNR. In general, 
SAMAT enables a significantly higher sum-rate than 
both SBF and AMAT. This achievement stems from two 
aspects: 1 ) the transmission of extra private messages; 2 ) 
the optimized power allocation for SAMAT. 


The rest of this paper is organized as follows. Section HU 
introduces the system model. Section HIH and HVl elaborate the 
optimal precoder design for SBF and statistical CSIT-aided 
AMAT, respectively. In Section |V] we formulate SAMAT and 
maximize the achievable ergodic sum-rate by optimizing the 
power allocation. Numerical results are shown in Section |Vl] 
while Section IVIII concludes the paper. 

Notations: Bold lower case and upper case letters denote 
vectors and matrices, respectively. The superscripts (•)^ and 
(•)^ represent the transpose and conjugate transpose. The 
notation diag(-) stands for a diagonal matrix whereas E{-) 


is the expectation operator. Amax(’) and Amin(-) indicate the 
largest and smallest eigenvalues of a matrix and their cor¬ 
responding eigenvectors are denoted by Umax(’) and Uinin(-)^ 
respectively. N(-) stands for the null space of a matrix. 
Operators Tr(-) and det(-) refer to the trace and determinant 
of a matrix. We denote Exp(c) as the exponential distribution 
with parameter c and U{a, b) as the uniform distribution. Let 
d’pD = {R G |R is positive definite}. 


II. SYSTEM MODEL 

Consider a MISO broadcast channel where the transmitter 
equipped with M antennas (M > 2) wishes to send private 
messages to two users each with a single antenna. Perfect CSI 
is instantaneously available at the receiver side whereas the 
transmitter acquires this information with a delay larger than 
the coherence time of the channel (such that the reported CSI 
known at the transmitter is uncorrelated with the current CSI). 
We assume perfect statistical CSIT which is characterized by 
the spatial correlation matrix. This is a reasonable assumption 
because channel statistics are more related to the scattering 
environment and independent of the transmission period. 

Rayleigh fading channel model is considered, which implies 
that the spatial statistics can be completely depicted by the 
second-order moments of the channel |01- Specifically, we 
denote the channel between the transmitter and user A in time 
slot j as hj and similarly gj for user B: 


h, = Ry* 
s, = Rf" 


Sw,j^ 


( 1 ) 


where j and are Mxl vectors with independent and 
identical distribution (i.i.d) CAf{0, 1) entries. They are as¬ 
sumed constant within one time slot and varying independently 
across time slots. R^ and Rp are full rank positive definite 
covariance matrice0 for user A and B respectively, which can 
be decomposed as Rfc = VfeAfeV^, k = A, B.Yk S 
is a unitary matrix whose columns are eigenvectors of R^, 
while the diagonal A^ that contains the eigenvalues of R^ 
is normalized as Tr(Afc) = M. A^ = I indicates the fc-th 
channel is spatially uncorrelated while rank(Afe) = 1 implies 
it is fully correlated iS. 

The proposed SAMAT framework is shown in Eig. [T] 
It contains two stages/three time slots. In the 1®* slot, the 
transmitter superposes four private symbols Sai, Sa 2 , Sbi, Sb 2 
and sends them to both users. Denote 
and Ub = QPb^^Sb as the encoded symbols with statistical 
beamformer and power allocation, where W = [wi W 2 ], Q = 
[qi q 2 ],PA = diag(Pi, P 2 ),Pb = diag(P 3 , P 4 ). = 

represents the Gaussian symbols intended to user 
k and PjsfcS^} = I. At the end of this stage, each user 
receives its desired signal as well as the overheard interference 
due to the superposed transmission. Denote t]a = and 

t]b = gi^u^ as the interference overheard by user A and P, 
respectively. In stage II, the transmitter has access to hi and 


^For rank deficient case, the symbol intended to user i is simply precoded 
by a column vector in N(Rj ). By doing this, the overheard interference of 
each symbol can be completely removed. Thus, we can transmit two symbols 
at one time instant, achieving a sum DoF of 2 as if we have perfect CSIT. 
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Fig. 1: Block diagram of the proposed SAMAT scheme. 


gi (delayed CSIT). Then, rjA and rjB can be reconstructed and 
broadcast via a single antenna in the following two slots. This 
stage helps both users eliminate the overheard interference and 
reinforce the desired signals. In addition, new private messages 
Sbu Sb 2 are sent to both users in a superposed fashion 
and this extra transmission makes use of statistical CSIT only. 
Pk > 0, fc = 1,..., 10 indicate the power allocated to each 
symbol, and , fc = 1,..., 3 denote M x 1 unit-norm 
precoders which depend only on statistical CSIT. 

The proposed SAMAT scheme facilitates a smart use of 
statistical and/or delayed CSIT. With statistical CSIT only, we 
simply put — Q and then SAMAT 

boils down to SBF in each time slot. With delayed CSIT only, 
we simply put P^ — Pj — Pg = P^g = 0. Then, SAMAT 
becomes the AMAT scheme and enables a sum DoF of | at 
high SNR. If the transmitter has both statistical and delayed 
CSIT, proper power allocation and statistical precoding can 
make room for extra symbols transmission. The benefits of 
transmitting new symbols overcome the loss caused by the 
interference it creates to the main AMAT transmission. In this 
case, the proposed SAMAT framework allows for the parallel 
transmission of SBF on top of AMAT while outperforming 
AMAT and SBF at any SNR. More specifically, the transmitted 
signals are written as 

Xi = + Ub 

X2 = VKIva, O]^ + v^was^i + v^qaSsi 

X3 = VK Ivb, 0 ^ -f + V^q3SB2- 

( 2 ) 

For (A)MAT-based schemes, the transmit power in stage II 
is inherently dependent on the channel realization that changes 
rapidly. The power consumption in each transmission period 
hardly keeps constant. Thus, a less restrictive metric is the 
long-term average power constraint 

Pc = P[Tr(xixf)] -f P[Tr(x 2 xf)] -f P[Tr(x 3 xf)] 

= Pi + P2 + P3 + -P4 + Pe + -Ft + Pg + Pio + 
Pai^AiPa + ^A^Pi) + Pai^BiPi + ^b^P^) 

< 3P, (3) 

where A^i = qfR^qi, = q?RAq 2 , A^i = 

wf^RsWi, Ab 2 = w|^RbW 2 . The expectation is taken over 
the input signals and the channels and P denotes the average 


power budget of the transmitter for each time slot. 

Although our framework is similar to lll2|] . there are essen¬ 
tial distinctions between them. A first distinction lies in the 
channel model. We exploit spatial correlation to compress the 
interference and make room for extra symbols transmission 
while they make use of time correlation. The power allocation 
in II 12 II depends on SNR and quality of current CSIT while 
our power allocation strategy relies on SNR, precoder design 
as well as spatial correlation. 

The secondary distinction lies in the encoding/decoding 
strategy (and hence the transmission protocol). More specif¬ 
ically, interference quantization is crucial for ifl^ . where 
the overheard interference symbol with a reduced power is 
transmitted with full power in order to save channel re¬ 
sources. Interference quantization is proposed to solve the 
consequent problem of power mismatch (which scales with 
transmit power). By decoding the interference symbols first, 

1 12(1 equivalently obtains one AMAT transmission plus two ZF 
transmissions. DoF gain at high SNR can be obtained over the 
original AMAT scheme. 

In contrast with 01211 . the overheard interference is multicast 
by analog transmission in our scenario and the reason is 
threefold. First, we retransmit the interference symbols after 
scaling them by constant (i.e., not scaling with the trans¬ 
mit power) factors P 5 and Pg. Pi, 2 , 3,4 in the main AMAT 
transmission scale with the transmit power at high SNR to 
achieve the DoF of |. However, to guarantee the power 
constraint, the multiplication terms PgiX^iPg -f A. 42 P 4 ) and 
Pai^BiPi + Ab 2 P 2 ) in eq. © limit P 5 and Pg to some 
constants. Second, interference quantization would prevent the 
proposed SAMAT scheme from bridging SBF at low SNR. 
More specifically, in the slot, SAMAT should behave 

as SBF at low SNR and should therefore allocate most of 
the transmit power to the extra symbols and only a little 
power to the overheard interference symbol. Following il2ll . if 
interference quantization is applied, the digitized interference 
should be decoded first by treating the extra symbols as noise. 
In this case, however, the decoding would fail because the 
noise power would overwhelm the desired signal power. Third, 
due to the inherent properties of the channel model (full- 
rank channel covariance matrix), a sum DoF strictly larger 
than I cannot be achieved in our case (contrary to il 2 |] ). 
Hence the SAMAT transmission and reception strategies are 
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not motivated by a DoF maximization. With SAMAT, a sum 
DoF of I is achieved where the extra private symbols are 
not used to increase the DoF at high SNR (contrary to 11211 ) 
but to boost the sum-rate at low/finite SNR. This implies that 
the retransmitted overheard interference does not have to be 
decoded first in SAMAT (contrary to lll2|] ) but can simply 
be aligned and cancelled so as to decode the private and 
extra symbols. Recall again that lll2ll relies on interference 
quantization to decode first the overheard interference and 
then the private messages in order to increase the sum DoF 
beyond |. The detailed description of our decoding strategy 
is provided in section [V] 


III. STATISTICAL BEAMFORMING 

Let us first focus on the scenario with statistical CSIT 
only. As mentioned before, the proposed SAMAT scheme 
boils down to SBF, i.e., the transmitter sends two statistically 
precoded symbols, each intended to one user. Since the system 
model for the transmission in each slot is identical, we 
can focus on one slot and omit the subscript. Assume the 
transmitter equally allocates its power budget to both users, 
the delivered signal described by (|2]i becomes 

yi = yfp'N Sa + y/p(\SB, (4) 

where p = -j. For simplicity, we will look at the rate 
performance of user A only and a similar derivation can be 
easily extended to user B. The received signal at the receiver 
side is given as y = ^h.^'w s a + <\Sb + nA, where 

h £ is the channel vector and Ua ^ CJ\f{0,l) is the 
standard complex additive white Gaussian noise (AWGN). The 
achievable ergodic rate of user A is given by 


=L;[log2(l + SINR^)], (5) 

where SINR^i = '^^e instantaneous signal-to- 

interference-and-noise ratio (SINR) by treating the multi-user 
interference as noise. Consequently, the ergodic sum-rate of 
the system with linear beamforming is i?suin = Ra + Rb- 

In the spatially correlated channels, the role of spatial 
statistics on the rate performance of a linear beamformer has 
been well studied in ||3l. They consider a MISO broadcast 
setting with M antennas at the transmitter and M single¬ 
antenna users. When M = 2, optimal precoders that maximize 
the ergodic sum-rate were developed at extremely high/low 
SNR. Meanwhile, the best precoding vectors are still unknown 
for general case (M > 2) due to the difficulty of computing 
a closed-form expression for the ergodic sum-rate. In this 
paper, we show by the following theorem that the generalized 
eigenvector (GE) is still valid for the two-user, M-transmit- 
antenna scenario. 

Theorem 1: Eor any M, the ergodic sum-rate of SBE at 
high SNR can be lower bounded by 


.^sum ^ ^t)g2 


/ w^R^w 

Vw^R^wq^R^qj ’ 


( 6 ) 


Wge = Uniax(R-B^R-^); Bge = Uinax(R.4 ) ■ (7) 

The corresponding lower bound of the ergodic sum-rate is 


Rsum,lb = l0g2 (x(R-b^R-a)) = l0g2 (x(R-a^R-b)) 


where x{') = 


x(') 


( 8 ) 




is the condition number. 


Proof: A detailed proof is relegated in Appendix |B] ■ 

A special cas^ of (O is confirmed by i03 Corollary 2] 
by noting that x(’) > 1- In the low SNR region where the 
interference can be completely ignored, the optimal choice 
is to send along the dominant statistical eigen-mode of the 
user’s own channel At intermediate SNR, however, 10 
has shown the difficulty of finding a closed-form expression 
of the optimal precoders even for M = 2 case. Instead, it 
is solved by an exhaustive search operated upon a linearly 
combined high- and low-SNR solution. In the general M > 
2 case, we compute only a high-SNR solution and avoid the 
line search method. The simulation results show that it works 
well at practical SNR. 

Remark 1: The closed-form precoders that maximize the 
ergodic sum-rate of SBE is difficult to compute due to the 
coupled nature in SINR expression. To solve this problem, 
we can use an alternative SLNR metric, which is defined as 
SLNR^ = ■ At high SNR, the maximization of 

a lower bound on i?[SLNR 2 i] also leads to the solution (|7]). 
Similarly, the effectiveness of the SLNR metric in designing 
multi-user transmit beamforming vectors has been examined 
in flEH. 


In contrast with SGEBE, the precoding vectors of 
SWEBE in correlated channel can be written as 'w^b = 
Uniin(RB), qwE = Umin(RA)- However, the rate performance 
of SWEBE is unfavourable in the scenario where both channels 
of user A and B have similar weakest eigen-direction (e.g., 
co-located users). To be specific, the precoding vectors which 
are designed to remove the interference also cancel out the 
intended signal. By contrast, the GE beamforming approach 
obtains a balance between interference cancellation and de¬ 
sired signal enhancement. In other words, SGEBE exhibits 
robustness with respect to different channels compared to 
SWEBE. Moreover, we can observe from the numerical results 
that SGEBE outperforms SWEBE further as M increases. 


IV. ALTERNATIVE MAT 

Let us focus on the scenario with both delayed and statistical 
CSIT. Under equal power allocation, OlSll has shown that 
additional channel statistics enable a higher achievable sum- 
rate relative to the original MAT. Flowever, there are big 
differences between this section and Cl. Eirst, they released 
the power constraint in the interference retransmission phase 
(stage II), which leads to a variation of the total transmit power. 
We here control the power consumption by using a long-term 
power constraint. Second, an efficient iterative algorithm is 


where 7 is the Euler constant. The precoders that maximize users share the same set of statistical eigenmodes but 

the lower bound in (|6]l are given by orthogonal dominant eigenvectors and M = 2. 
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developed to compute the statistical precoders to maximize an 
approximation of the ergodic sum-rate. Particularly, monotonic 
convergence of the algorithm is proved. This section reveals 
how much the ergodic sum-rate can be improved by statistical 
precoder only. It will be used as a baseline for the next section 
where the rate performance is further enhanced by additional 
power allocation. 


A. Rate Approximation 

The proposed SAMAT scheme becomes statistical CSIT- 
aided AMAT by simply letting Pg = Pj = Pg = P^^g = 
0, P5 = Pg = 1 in (|2]i and the transmitter distributes equal 
power to four symbols, denoted as p. The transmitted signals 
can be expressed as 

Xi = y/pW Sa + a/pQsb 

X2 = yp[hfQsB, 0]^ (9) 

X3 = yp[gfWs„ O]"’. 

With delayed CSIT only, W = Q = I and the proposed 
SAMAT scheme further boils down to the original AMAT 
scheme. For simplicity, we focus on the performance of user 
A and similar results can be symmetrically applied to user B. 
The signal vector received by user A is given by 


yA = y/p 


■ hfW ■ 


■ hfQ ■ 


nAi 

0 

+ s/p 

K,h/Q 

Sb + 

1T'A2 

_h:,gfw_ 


0 


nA3_ 


where y.4 = [y^i,2/^2, 
over three time slots and h. 


( 10 ) 

denotes the received signals 
Ljm denotes the channel coefficient 
between m-th transmit antenna and user A in time slot j. 
Uaj CN{G, 1 ) is the normalized complex AWGN. After 
further interference elimination 


-f 


hg^tlAl tlA2 
nA3 


( 11 ) 


where H = hjm denotes the 

channel coefficient between m-th transmit antenna and user 
A in time slot j. By using a Minimum Mean-Square Error 
(MMSE) receiver with Successive Interference Cancellation 
(SIC), the ergodic rate achieved per slot by user A is written 
as: 



log 2 det (l-t- pH^K-^h) , 


( 12 ) 


where K is the covariance matrix of the noise vector in (fTTl l 
and given by diag(l + |/i2ip,l). It is challenging to obtain 
the closed-form expression of the ergodic rate, especially for 
M > 2 case. Hence, we optimize the linear beamforming 
vectors based on an analytical approximation of Ra, which is 
given by the following proposition. 

Proposition 1: In spatially correlated Rayleigh fading chan¬ 
nel, the ergodic rate of user A for AMAT can be approximated 
as 


TABLE I: Algorithm 1 & 2 : Precoder Optimization for AMAT 

1 : Initialize: Set iteration index m = 0 , randomly generate 
and then normalize 

2 : Repeat 

3: m •<— m -f 1 

4 : Update with GradAct [Algorithm 1], or 

with Max-Eig [Algorithm 2] 

5 : Update with GradAct or with Max-Eig 

6: Until < e 


where 

0^ = Tr(W^R^W)Tr(W^RBW)- 

Tr(W^R,iWW^RBW) ( 14 ) 

and a = eEi(— 1 ) — 27, Ei(a:) = — is the exponen¬ 

tial integral. 

Proof: See Appendix ICl ■ 

Then, we can obtain that i?sum = Ra + Rb ~ 
I log2 (1 + pV^A^ + I log2 [1 + , where 

0B = Tr(Q^R,iQ)Tr(Q^RBQ) - Tr(Q^R^QQ^RBQ). 
It shows that the AMAT scheme exploiting delayed CSIT 
enables a DoE of | at high SNR, while the beamforming 
based on statistical CSIT makes no contribution to the DoE 
gain. However, the ergodic rate performance at practical SNR 
benefits from such spatial correlation information. Observe in 
(foi l and (fTrli that the ergodic rate relies on the precoders and 
the spatial correlation matrices. The latter terms are invariable 
in a certain environment. Therefore, Ra and Rb only depend 
on Qa (W) and 0 b (Q), respectively. To maximize the 
ergodic sum-rate performance, the precoders W and Q can 
be independently designed. Let us focus on 0^1 only and 
optimize W. Similarly, we can obtain the optimal Q that 
maximizes 0 b. 


B. Precoder Design 

1 ) Multi-antenna case (M > 2 ): It is difficult to obtain 
a closed-form expression of the beamforming vectors that 
maximize (HI and further ([Oi l. Eor such a problem where 
joint optimization is difficult but the objective function is 
convex in each of the optimization variables wi and W2, an 
alternating algorithm, also known as Block Coordinate De¬ 
scent method, has been widely used in optimization Cllli. 
More specifically, we maximize (O by sequentially fixing one 
vector and updating the other. Eix W2 and focus on wi (vice 
versa, the following derivations still hold). We can reformulate 
the subproblem as 


max 021 (wi) = wf^R2iWiw|^RBW2 -f w;P^RbWiw|^R2iW2 
— wf^R2iW2w|^RBWi — wf^RBW2w|^R2iWi 
s.t. Ilwill = 1. 

( 15 ) 

Since it is convex in W]Q the classical gradient ascent 
(GradAct) method can be used to determine the optimal 


Ra 


2 

3 


l0g2 (1 + psf^A^j , 


(13) 


'^The convexity can be easily proved with the second order condition, which 
is omitted here for conciseness. 
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solution (step 4 of Table I). Once the optimal Wi is obtained 
in terms of certain W 2 , the process is repeated the other 
way around (step 5), leading to an iterative algorithm. Since 
the steepest ascent direction acts as the best direction to 
increase the objective function, a proper step size can be 
computed for a non-decreasing objective value, i.e., < 

Q^m, 5 ) ^ Qpn-i- 1 , 4 ) 00 *’^) refers to the objective 

value at step 4 in the m-th iteration in Table |I] (Algorithm 
1). Thus, the convergence of Algorithm 1 is ensured, since 
0 ^ is monotonically increased (non-decreased) after each 
iteration and upper bounded. Even though the optimal solution 
is obtained for each subproblem, the iterative algorithm can 
not guarantee the global optimal beamforming vectors. 

Alternatively, (fTSl l is quadratic in wi and the op¬ 
timal solution can be obtained by eigen-decomposition. 
Rewrite d as 0 a(wi) = M(w 2 ) wi, where 

M(w2) = W^RbW2Ra + wf R^W2 Rb - RAW2wf Rb 
—RBW 2 w|^Ryi. The closed-form solution is the maximum 
eigenvector (Max-Eig), 

wi = arg max 0 . 4 (wi) = u^ax (M (W 2 )). (16) 

||wi|| = l 

With d at hand, we can easily compute the optimal 
precoders by the proposed iterative approach. In EiglJ] we 
show by two cases (M = 4, 8 ) that the iterative algorithm 
converges very fast, where the covariance matrices are ran¬ 
domly generated. 

2) Two-antenna case (M = 2): A special case of consider¬ 
able interest is the two transmit antenna scenario. The optimal 
precoders can be easily obtained by the following proposition. 

Proposition 2: Eor M = 2 MISO BC with spatially 
correlated Rayleigh fading, any unitary beamforming matrix 
is optimal to maximize (d and further (fOT l. 

Proof: Eor arbitrary M and W 2 , it is easy to verify 
that w|^M(w 2 )wi = 0, i.e., M(w 2 )wi G N(w 2 ). The 
maximization of 0 . 4 (wi) = wf^M(w 2 )wi leads to the 
observation that the optimum wi G N(w 2 ). Similarly, when 
we fix wi and update W 2 , we have the optimum W 2 G N(wi). 


It implies that the optimal beamforming vectors are always 
orthogonally chosen (wi _L W 2 ). Eor the special M — 2 
case, since Wi is uniquely defined in N(w2) and vice versa, 
any two beamforming vectors constituting a unitary matrix 
are optimal. Moreover, eq. Gli becomes constant 0^1 = 
Tr(R.4)Tr(RB) - Tr(R.4RB). ■ 

This proposition reveals that any orthogonal beamforming 
vectors with equal power allocation achieve the same ergodic 
sum-rate performance, which is verified by the Eig. |4(b)| in 
section Then, let us compute the equal power allocation p. 
According to (O, the long-term average power consumption 
for AMAT is represented by 

Pc =4p -f pTr(Q"RBQ) + pTr(W^R.,W) 

=p (4 -b qf RbQi -b qf RBq2 + wf R. 4 W 1 -b wf R. 4 W 2 ) 
<p(4-b2M). (17) 

where the inequality (fTTl i is obtained by using x^^Rxi -b 
x|^Rx 2 < Ai(R) -b A 2 (R) < Tr(R), where unit-norm x^ 
are mutually orthogonal and Ai(R) corresponds to the i-th 
largest eigenvalue iEHi . iE^I . In order to maintain the power 
constraint, equal power allocation is calculated as p = 

(e.g., P = ^ for two transmit antennas). Equality in (fTTI i 
holds for M — 2 case, which also justifies proposition 2 in 
the sense that orthonormal precoders (optimally) use up all the 
power budget. 


V. STATISTICAL AMAT 

In the last section, we explored how the statistical CSIT aids 
in AMAT under equal power allocation. The corresponding 
rate performance is superior to the original AMAT, how¬ 
ever, still inferior to SBE in highly correlated channels at 
low/intermediate SNR regime. In fact, the ergodic sum-rate 
can be further enhanced by power allocation optimization. The 
proposed SAMAT scheme in Eig. [1] is developed to overcome 
both SBE and AMAT at any SNR in arbitrary spatial condition. 
Proper power allocation and statistical precoder design can 
compress the received interference ? 7 a(? 7 b) in I'** slot (as well 
as common symbols in slot). Meanwhile, it makes 

room for new symbol transmission in stage II which brings 
more rate benefits. To achieve this, the power allocation is 
optimized based on a tractable approximation of the ergodic 
sum-rate of SAMAT. The power allocation strategy that maxi¬ 
mizes the rate performance depends on SNR, precoder design 
as well as spatial correlation. 

Hereafter, we focus on user A and similar results can be 
derived for user B. The received signal of user A can be written 
as 

= HiPy's.4 + H 2 Pi/"sB + H 3 sP + H 4 si + n.4, (18) 


where 



hfW 


0 

0 

<11 

0 


VR hfw3 

0 

0 

y^hfw3 
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H 2 4 


hfQ 

yft/i 2 ihfQ 

0 


H4^ 


0 0 

hf^qs 0 

0 hf qa 


( 20 ) 


Rsa ~ I0S2 (JaiPi + TA 2 P 2 ) + 1^412 (Abi^^I + XB 2 P 2 ) 


and = [y^i, J/Aa, 1 / 213 ]^, = [sa 3 , Sa 4 ]'^', s^, = 

[sb 3 , Sb 4 ]’^- = [riAi, nA 2 , riAsV with n^j ^ CN{Q, 1). The 

decoding procedure that mainly uses interference alignment 
and cancellation (similar to |@1) is described as follows. Denote 
Ya as the received signal after subtracting y/P^ h*-^ ■ Dai from 
?/a 2 and retaining j/aO First, decode the private symbols (s^) 
by regarding the extra symbols (s^, Sb) as interference: 


Ya = HiPy's 4 + H 2 Pi/'sB + H 3 sP + H 4 sP + (21) 




+ ^AiSa2P>aPiP2^ 
log 2 ( 1 + 


'^A^Pp, 


1 + F 5 + ^AsPl 


+ l 0 g 2 1 + 


(25) 

TA 3 P 9 
1 “f ^AsPlO 

(26) 


Similarly, we have Rg = 2 {Rsg + ) and 


Rs 


10g2 [TbiPs + TB2P4) + Sb 2 i^AlPs + AA2P4) 


Where Hi = [(hf W)^,W)^, 

H 2 = [(hfQ)^, 0^, 0^]^ and = [n^i,nA 2 - 

V^/i*jnAi, riAs]^. K is the covariance matrix of the 
interference plus noise vector z. It is given by diag(A:i, k 2 , k^), where 
where ki = l + |hf QPbP, ^2 = l + Psj/iaip+Pgjhfwaf+ 

PTjhfqaf ,fc 3 = 1 + Pg|hfw 3 f + Pio|hfq 3 p. To resolve 5ai =- 
s,i, MMSE-SIC receiver is applied on dTTT i and the ergodic 
sum-rate of can then be written as 


log 2 1 + 


Rsa — P 


log2 det (l + PAHf K-iRiP, 


+ 6b\^B2^ bP^Pa 
TB 3 P 7 


(27) 


1 + AB 3 P 6 


+ l0g2 I 1 + 


'^BsFio 


1 + Ps + AbsPq 
(28) 


+ 


p. 


1 + AA1P3 + AA2P4 1 + P5 + TA3P6 + AA3P7 
. 1 , Px 

Obi ^ 1 ^:- — + 


1 + AbiPi + AB2P2 1 + Ps + AB3P9 + Tg^Pii 


( 22 ) 


Sa2 = 


P« 


P. 


^ _ 

1 + TA3P9 + AA3P1O 1 + AB3P6 + Tg^Pj 


(29) 


Once Syi is obtained, we can subtract it from y^. Then, we 
can decode the extra symbols by taking the second and third 
entries of Ya as yA^ 


yA=H 3 sP+H 4 SP +h^, (23) 

where H 3 = diag(v^hfw 3 , v^hfw 3 ), 

H4 = diag(V^hfq 3 , v^Pf hfq 3 ) and Ua = 
[nA 2 — V^hj^riAi, riAs]^. The covariance matrix of 
h^ is given by N = diag(l + Psj/igip, 1). The ergodic 
sum-rate of sP is given by 




log2 det (l + Hf (N + H4Hf )-iH3) 


(24) 


It is challenging to obtain the closed-form expression for the 
ergodic rate. Instead, we derive a tractable approximation and 
optimize the power allocation based on such approximation. 

Proposition 3: The achievable ergodic sum-rate per slot at 
user A with linear beamforming can be approximated as Ra = 
I {Rsa + 7?sP ) where 


^A(MAT)-based schemes use one observation to completely remove the 
overheard interference and two independent observations remain to resolve 
two symbols. By contrast, we cancel the overheard interference while we 
maintain all three observations. The reasons are explained as follows: 1) In 
some cases (e.g., highly correlated channel) where SAMAT boils down to 
SBF in each time slot, symbols need to be decoded slot by slot. However, 
conventional decoding strategy causes rate loss because one observation is 
dropped. 2) The ergodic rate of the proposed decoding method is slightly 
better than the conventional one, due to one more observation even with 
strong interference. 


Aai = qf RaQi, Aa2 = qf RAq2, Abi = wf RbWi 
Ab2 = wf RbW2, Tai = wfR4W1, Ta2 = wf R^W2 

Tbi = qfRsqi, Tb2 = qfRBq2, A43 = qfRAQs 
Ab3 = wf RbW3, Ta 3 = wf R4W3, Tga = qf RBqs ( 30 ) 

Qa = Tr(W"RAW)Tr(W"RBW)-Tr(W"RAWW^RBW) 
Db = Tr(Q"RAQ)Tr(Q"RBQ) -Tr(Q"RAQQ"RBQ). (31) 

Proof: Refer to Appendix |D] for proof. ■ 

Remark 2: Compared to the interference quantization ap¬ 
proach in El, the analog transmission induces a noise en¬ 
hancement. Namely, interference alignment cancels the over¬ 
heard interference while scaling up the noise by P 5 {Ps). 
This noise enhancement can be observed in (ITTI l ~ ( l28l l. At 
low SNR, the proposed SAMAT scheme behaves as SBF in 
each time slot. The scaling factors are small and therefore 
the effect of noise enhancement is negligible. The gain over 
AMAT mainly comes from extra symbol transmission and 
statistical precoding. At high SNR, the proposed SAMAT 
scheme behaves as AMAT, achieving a DoF of |. In this case, 
the ergodic rates of extra symbols can be eliminated by noise 
enhancement. Namely, we have little beneht by transmitting 
extra symbols. However, the proposed SAMAT scheme still 
achieves signihcant gain over SBF and AMAT by power 
allocation optimization and statistical precoding. 

With predehned beamforming vectors, the proposed 
SAMAT scheme softly bridges between SBF and AMAT by 
power control. Fet us concentrate on two cases: 

case 1. bridge between SWEBF and AMAT, Wi = 

Umax(RB), qi = UmaxlR^), W2 = W3 = Wjve = 
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Umin(R'B); Q 2 — Qs — QwB — Umin(I^A)5 

case 2: bridge between SGEBF and AMAT, Wi = 
Umm(RB^RA), qi = U„,i„(RX^RB), W2 = W3 = Wge = 
Umax(RB^RA), q2 = qs = qcE = Un,ax(Rl^RB)- 

case 1 is used to show the efficacy of the power alloca¬ 
tion optimization technique by which the proposed SAMAT 
scheme can softly bridge between SWEBF and AMAT. Be¬ 
yond this, case 2 makes better use of statistical CSIT in 
the sense that SGEBF exhibits higher robustness compared to 
SWEBF. Instead of using the optimized AMAT precoders in 
transmission stage I (wi,W2,qi,q2 as developed in Section 
113, we use the precoders above (wi,W2,qi,q2 as WE/GE 
precoder) and the motivations are explained as follows. First, 
the optimal precoders in Section |I3 that maximize the ergodic 
sum-rate of AMAT under equal power allocation are not 
necessarily optimal for SAMAT with power control. Second, 
SAMAT boils down to SBF at low to intermediate SNR 
in highly correlated channel, where the optimized AMAT 
precoders may cause a poorer rate performance compared to 
the WE/GE precoders. In order to softly bridge between SBF 
and AMAT, we adopt the precoder design as above. 

Consequently, i?sum — Ra + Rb and the ergodic sum-rate 
optimization problem is formulated as; 

^x i?sum S.t. Pc = 3 P, Pi > 0 i = l,..., 10 . ( 32 ) 


It was shown that the sum-rate optimization generally leads 
to an intractable NP hard problem 02311 . Thus, an algorithm 
achieving global optimum cannot be expected. However, Se¬ 
quential Quadratic Programming (SQP) algorithm provides 
an efficient way to solve non-linear constrained optimization 
problem. An overview on SQP is provided in |23-26|. Briefly, 
a quadratic approximation of the Lagrangian function is made 
by applying quasi-Newton updating method. The consequent 
QP subproblem can be optimally solved and then the solution 
is used as a search direction. With proper line search, an 
estimate of the solution is computed for the next iteration. 
This SQP algorithm can guarantee a super-linear convergence 
to a local minimum. 


In order to get insights into the optimal power allocation, 
a necessary condition for optimality of the constrained prob¬ 
lem (l3^ is identified from the Karush-Kuhn-Tucker (KKT) 
equations. Thus, the optimum power allocation that depends 
on the precoders and the channel covariance matrices is stated 
as follows. 

Theorem 2: At high SNR, the optimal power allocation that 
maximizes i?sum in (l32l l satisfies: 


El _ 1 + Ab2Es E3 _ 1 + 

P2 1 -f XbiPs Pi 1 + XaiPs 

where A^i, A 212 , Abi, Ab 2 are defined in ( 1 ^ . 


(33) 


Proof: At high SNR, problem (l32l i can be rewritten as 


(r) 

n^X -^^sum ~ {.^ai^A2^aPiP2) bi^bP^Pa, 

+ l0g2 (1 + - 


+ logs ( 1 + 


tbsP? 

I+AbsPb 


) + ‘“Si (1 + 

log 2 (] 


1 + 


TB 3 PIQ ) 

I+Ps+AbbPq / 


s.t. Pc — 3 P = 0 , Pi > 0 / = 1 ,..., 10 


(34) 


where r; comes from the fact that the last terms in (l25T l and 
dlTli are dominant at high SNR. Based on KKT necessary 
conditions, there exist multipliers A and /ii,/r 2 such that 


r Vi?suin(El) — A(1 -f XbiPs) + /il 

< Vi?suin(E2) = A(1 -f Ab2E8) + 1*2 (35) 

[ HlPl = 0 , fl 2 P 2 = 0 , 

where P^ ^ 0,^2 7 ^ 0, otherwise DoF loss occurs due to 
Rsa = 0. Therefore, we have /ii = /i 2 = 0 and the first 
equation in ( l33l l can be computed from (l35T l. Likewise, the 
second equation can be obtained. ■ 

Remark 3: As can be seen from (l3?t . the power allocation 
depends on the spatial correlation, the precoder design as well 
as SNR. Take case 1 as an example, Ab 2 = w^RbW 2 = 
A^.n(RB) while Abi = A.nax(RB). Then, 1 

implies that P^ < P^ and likewise P 3 < P 4 . This implies 
that more power needs to be allocated on the weaker eigen¬ 
mode (W 2 ) to constrain the interference imposed to the other 
desired symbol of the same user. As mentioned before, the 
transmitted power in 2 '“* and time slots is a function 
of beamformers. Such power allocation method enables to 
compress the interference and makes room for delivering two 
more private symbols. Moreover, consider i.i.d Rayleigh fading 
channels where R^i = Rb = I- Ab 2 = w|^RbW 2 = 1 and 
likewise Abi = A^n = A^a = 1- Based on ([33]) and symmetry, 
the optimal power allocation satisfies Pi = P 2 = P 3 = Pi 
and therefore SAMAT boils down to AMAT. It makes sense 
because no correlated information is available to suppress the 
interference. In this case, equal power allocation is the optimal 
choice. 

To operate the proposed SAMAT transmission protocol, 
the signaling and feedback procedure is described as follows. 
Using LTE-A framework 12711 . channel state information ref¬ 
erence signals (CSTRS) are transmitted to enable the receiver 
to measure the short-term CSI and the long-term CSI (channel 
covariance matrix), which are then fed back to the transmitter 
via a delayed but assumed perfect feedback link. The long¬ 
term CSI only varies at a very slow pace and is therefore not 
affected by the delay. However by the time the transmitter 
has acquired the short term CSI, the channel has changed 
and the transmitter only has knowledge of a completely stale 
short-term CSIT. Based on the long-term and the short-term 
CSIT, the transmitter computes the precoders and the power 
allocation and constructs the transmitted signals that are then 
transmitted using demodulation reference signals (DM-RS) 
El. As far as the implementation complexity is concerned, 
the potential challenge of SAMAT lies in numerical power 
allocation computation. Eq. (l3^ provides a necessary condi¬ 
tion which reveals the basic relationship between the power 
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Fig. 3; Rate performance comparison of SBF with WE and 
GE precoder. 


allocation and the spatial correlation matrices as well as the 
precoders. It would be helpful to identify a sufficient condition 
and further closed-form power allocation. 

VI. PEREORMANCE EVALUATION 

We provide numerical results to show the efficacy of the pro¬ 
posed precoder design and power allocation strategy. A single 
parameter exponential correlation model lE^ is considered as 

1 tk ... 

tk 1 tk~^ 

Rfc = . (36) 

■•■if 1 . 

where tk denotes the transmit correlation coefficient tk = 
\tk\e^'^'‘, (pk G [0, 27r], k = A^B. Throughout the paper, 
we use high(low) correlation to indicate large(small) condition 
number of the spatial correlation matrix, which corresponds to 
a large(small) \tk\ in the exponential model. A large family of 
spatial correlation is tested to verify our analysis. With the 
help of the optimization tool in Matlab, ‘fmincon’ is used to 
implement the SQP algorithm. 

A. Precoders Comparison for Statistical Beamforming 

In Pig. |3l we plot the ergodic sum-rate of SBP with WE/GE 
precoders, averaged over the randomness in the channels and 
(pk. The amplitudes of channel correlation coefficients of both 
users are given by |fyi| = 0.95, Ifsl = 0.9 and the superiority 
of GE over WE precoder is illustrated by two cases (M = 
2,4). In Pig. [2 GE beamforming vector shows robustness 
for large M as well as varying scattering environment (i.e., 
PC). Interestingly, WE precoded SBP performs even worse 
for larger M, which is inherently caused by the idea of zero 
forcing. The precoder is designed to reduce the interference 
imposed to the unintended user, but may cancel out the desired 
signal of the intended user. In other words, as M increases, the 
M X 1 WE precoder w = Uniin(R-B) may fall into the {M — 1) 
dimensional N(h) with higher probability. 


AMAT M = 4 



(a) M = 4. 
AMAT M = 2 



Pig. 4; Rate performance comparison of AMAT with different 
precoders. 


B. Precoders Comparison for Alternative MAT 

In Pig. |4(a)[ we compare the ergodic sum-rate performance 
of AMAT with different precoding methods, tk is randomly 
generated: \tk\ G U{Q,l)^<Pk G C/(0, 27r). ORG denotes 
original AMAT that the transmitter sends symbols simply 
using 2 out of M antennas. WE and GE are statistical 
precoders dehned in section |III] The optimal precoders (OPT) 
is computed by the proposed iterative algorithm in Table I. 
We can observe that OPT achieves a better ergodic sum-rate 
than others. 

Meanwhile, Pig. |4(b)| conhrms the validity of proposition 
2. It can be seen that any orthogonal beamforming vectors 
constituting a unitary matrix are optimal for M = 2 case. 
More specihcally, Rnd indicates that W and Q are randomly 
generated unitary matrices. WE precoders, corresponding to 
Umax(Rfc) and Utnin(Rfe) k = A, B, also form unitary matri¬ 
ces. ORG becomes an 2 x 2 identity matrix. All these precoders 
show optimality in terms of the ergodic sum-rate whereas GE 
does not, because GE precoders fail to form a unitary matrix 
(since either or R~^R^ is a normal matrix). 
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C. Performance of the Proposed Statistical AMAT (SAMAT) 


Fig. |5(a)| depicts the achievable ergodic sum-rate perfor¬ 
mances of various schemes with two transmit antennas (M = 
2): original AMAT, SBF with WE precoders and the proposed 
SAMAT (case 1). We set |t^| = Itsl = |f| that varies 
between 0 and 1, i.e., from uncorrelated to highly correlated 
channels. Furthermore, fg are randomly generated with 
If A — fsl ^ ^ and SNR = 20 dB. As |f| increases, the sum- 
rate of SBF gradually goes up while a sharp rise occurs at 
very high coiTelation level. Because in highly correlated spatial 
channels, linear beamforming based on statistical information 
keeps the remaining interference small enough. A special case 
is the fully correlated channel. When |f| = 1, the overheard 
interference can be completely cancelled. 

Moreover, the rate performance of original AMAT also 
depends on the transmit corTelation of the channel. In M = 2 
case, ORG precoders for original AMAT become W = Q = 
T- 2 X 2 - Observe in ( fT3] ) and (fT4l i that the ergodic rate is a 
function of and R^. More specihcally, 0^ = 0 b = 
Tr(Ryi)Tr(RB) — Tr(RyiRB). With the correlation model in 
(l36l l and the specihc phases fg, a positive/negative impact 
of transmit correlation amplitude |f| can be easily computed; 
0A = 0B = 2(l — |fp| •cos(|(/)a —(/'b|))- As |f| increases, the 
transmit correlation is benehcial when \(j)A — fsl > f while 
it is detrimental when \(j>A — 4>b\ < f ■ When \(j)A — fsl = f, 
the ergodic rate keeps constant irrespectively of |f|. 

The cross point between SWEBF and AMAT is determined 
by the spatial correlation level and SNR. Eig. |5(a)| reveals 
that the proposed SAMAT scheme obtains strictly higher rate 
than SWEBE and AMAT by exploiting both statistical- and 
delayed-CSIT. Since the channel statistics includes partial 
current CSI, we can ‘virtually’ regard it as an imperfect current 
CSI. Eig. |5(a)| coincides with Eig. 1 in lll2n in the sense that 
the proposed schemes softly bridge between SWEBE (PZEBE) 
and AMAT (MAT) in terms of the ergodic sum-rate (DoE). 

In addition, for given channel covariance matrices, the 
ergodic sum-rate of these schemes can be plotted versus SNR. 
It can be observed from Eig. |5(b)| that SAMAT achieves higher 
rate than SWEBE as well as AMAT along the entire SNR 
region. It acts as SWEBE at low SNR while it utilizes the 
DoE capability of AMAT in the high SNR regime. As a 
comparison, PZEBE with perfect instantaneous CSIT reaches 
a sum DoE of 2 at high SNR. A variety of spatial correlation 
is simulated but omitted here for conciseness. Briefly, when 
I^aI, I^bI —> 0, the SAMAT transmission protocol boils down 
to AMAT since no correlated channel information can be 
exploited to enhance the rate performance. Consider the other 
extreme |f^|,|fB| —> 1 but 0, it indicates 

highly correlated channels but their weakest eigen-modes lie 
in the similar direction. The rate performance of SWEBE is 
unfavorable and therefore SAMAT also behaves as AMAT. 

Eig. |6] illustrates the benehts of the proposed scheme with 
the power allocation optimization. The transmitter antennas 
M = A and robust GE precoders are considered. (pA and fs are 
randomly generated. Specihcally, AMAT indicates the original 
AMAT with equal power allocation only exploiting delayed 
CSIT while AMAT_OPT denotes AMAT precoded by optimal 



Eig. 6; Comparison of the ergodic sum-rate vs. SNR between 
SAMAT and baselines, (Ia] = 0.95, Ifsl = 0.9, 4>A,fB S 
f/(0,27r), M = 4. 


linear beamforming vectors developed in section |IV] SGEBE 
denotes the SBE scheme with GE precoders. We plot SAMAT 
(case 2) with the SQP algorithm. Moreover, we compare the 
proposed SAMAT scheme with VMAT |13]. As mentioned 
before, the power constraint of VMAT in stage II was released. 
To make a fair comparison, we also apply the long-term power 
constraint for VMAT and scale it down to 3P. 

In Eig. |6] we observe that AMAT_OPT enables around 5 
dB enhancement over original AMAT at high SNR. VMAT 
achieves almost the same ergodic sum-rate as AMAT_OPT, 
since both schemes exploit statistical CSIT under equal power 
allocation. However, the SBE scheme still outperforms all of 
them in a certain range of low to intermediate SNR. The 
proposed SAMAT framework which is precoded by GE with 
closed-form power allocation outperforms all these schemes. 
Meanwhile, with the optimized power allocation computed by 
the SQP algorithm, the SAMAT scheme maximizes the ergodic 
sum-rate (further 2 dB over AMAT_OPT). The enhancement 
over VAMT/AMAT_OPT mainly comes from the optimized 
power allocation. 

To sum up, SAMAT boils down to AMAT in low- 
correlated/uncorrelated channels while for highly correlated 
scenario where SBE outperforms AMAT, it behaves as SBE in 
the low to mediate SNR regime and as AMAT at high SNR. 
In addition, the optimized power values satisfy Theorem 2. 


VII. CONCLUSION 

This paper aimed to exploit both statistical and outdated 
CSIT in a MISO broadcast setting to maximize the ergodic 
sum-rate. We considered the robust design of statistical beam¬ 
forming vectors for arbitrary transmit antennas, showing the 
optimality of dominant generalized eigenvectors in maxi¬ 
mizing a lower bound of the ergodic sum-rate. Moreover, 
the optimal precoders were designed to maximize the rate 
approximation of AMAT under equal power allocation. An 
iterative algorithm was explored to compute these precoders 
with fast convergence. 
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(a) |(jiA - 0 b| > f, SNR = 20 dB, M = 2. 


(b) 10^ - 0 b| > f, |tA| = 0.95, \tB \ = 0.9, M = 2. 


Fig. 5: Comparison of the ergodic sum-rate vs. \tk\ or SNR for various schemes. 


The SBF and AMAT schemes show different performance 
behaviour depending on the spatial coiTelation and SNR. 
To overcome this, the SAMAT transmission protocol was 
proposed to bridge between SBF and AMAT for a wide range 
of SNR and an arbitrary number of transmit antennas. In 
low coiTelated channel, the SAMAT scheme boils down to 
AMAT because limited spatial correlation can be exploited to 
enhance the ergodic sum-rate. For highly correlated scenario, it 
employs the advantage of SBF in the low to intermediate SNR 
region and the DoF capability of AMAT at high SNR. To sum 
up, the proposed SAMAT scheme yields a significant ergodic 
sum-rate enhancement over both SBF and AMAT. At low 
SNR, the gain mostly comes from extra symbols transmission. 
At high SNR, it is achieved by power allocation optimization 
and statistical precoding. 

Numerical results were provided to confirm the design and 
the analysis of this paper. Simulation outputs illustrated that 
the proposed SAMAT scheme with optimized power allocation 
achieves a significant ergodic sum-rate enhancement over both 
SBF and original AMAT. At last, a joint optimization on 
precoder design and power allocation is an ongoing work. 


Equalities (a) and ( 6 ) can be easily obtained by applying 
Tr(AB) = Tr(BA). Then 


F; [ln(|h^wp)] = 

d 


E 




F;[ln(h^A^h„)] (39) 

F;[ln(Ax|/i^.mP)] (40) 


where = indicates the equivalence in distribution and (l40l) is 
calculated with the non-zero element in A^. Then, dTTl) can 
be obtained via dMt and the fact that ~ Exp(l). ■ 


Lemma 2: Suppose x,y are two random variables. E{y) ^ 
0, let f{x,y) = I and /r = {E{x),E{y)) = {y,x,yy)- The 
first order approximation of the expectation of /(a:, y) can be 
written as: 


E (-\ =^ + o( 

\y) Aiy V Aiy My / 


(41) 


Appendix A 
USEFUL LEMMAS 


Proof Sketch: The closed-form of E is unknown, 
however, it can be calculated via bivariate Taylor expansion at 
y: 


Lemma 1: Consider a non-zero vector w G and h = 
R^/^ hw, where R € <I>pd is a M x M Hermitian matrix. 
Then, 

E [In (|h^wp)] = In (w^Rw) - 7 , (37) 

where 7 is the Euler constant. 

Proof: Define X = R^/^ww^ R^/^ and decompose it as 
X = UxAxUx. Due to rank(X) = 1, the diagonal matrix 
Ax has only one non-zero entry. Let us define it as the m-th 
entry, denoted by Ax. 

Ax = Tr(Ax) = Tr(X) w^Rw. (38) 


00 

£(/(*.»)) = + c*^) 

My Py 

where = E [(x — ■ {y — PyY] ■ Take the first order 

approximation of (l42l) and (l4Tl i is obtained. Similar results 
were derived in an alternative manner S. However, it is 
difficult to calculate the high-order terms in (l42l l so that the 
first and second order approximations were used in ifiHl . It is 
assumed here that T' (f j is bounded and its Taylor expan¬ 
sion converges. Moreover, if x, y are mutually independent 
nonnegative random variables, the first order approximation is 
a lower bound, i.e., E{-) > —. 
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Appendix B 

PROOF OF THEOREM 1 


The proof relies on deriving a lower bound on the achievable 
ergodic sum-rate. According to (HJ, we can rewrite the ergodic 
sum-rate as 


f^sum — E 
= E 

E 


+ T+Sh^)] + ^ (l + TfS?w) 


log 2 (l + exp (in 


-f 


log2(l + exp(ln(p^jS^))) 

logs [l + exp (e; (In (p|h"wp)) - E (in (l -P p|h"qp)) 
+ log2 [l + exp (s (In (p|g^qp)) - E (in (l -P p|g"w|^)) 


(a) 

> 




l 0 g 2 



pw^R^w\ 

pq^R.4q ) 



pq^Rgq \ 

pw^R^w J 


> l 0 g 2 


/ w^R^w q^R^q N 
Vw^RbW q-f^R^qy ' 


Since log 2 (l + re^) is convex in x for r >0, we can 
obtain (a) with Jensen’s inequality. At high SNR, (b) can be 
asymptotically approximated by first dropping ‘1 -P’ in the 
parentheses and applying Lemma 1. The tightness of ( 6 ) has 
been shown in the asymptotic regime (M — p oo) 1^. More¬ 
over, the lower bound in (c) is tight in high-correlated system 
with proper beamforming vectors. Interestingly, a recent work 
S independently proved that i?sum can be well approximated 
by (b) in massive MIMO system. 

With (c) at hand, we can transform the optimization prob¬ 
lem into 


max i?sum.lb = log 2 
l|w|| = l,||q|| = l 


/ w^R^wq^RsqN 

Vw^R^wq^R^qj ’ 


(43) 


for which the generalized eigenvector structure is the optimal 
solution llssll . as shown in eq. dTji. w corresponds to the 
dominant eigenvector of R“^R^ while q corresponds to the 
weakest one. The corresponding ergodic sum-rate satisfies 
Rsum.ib = log 2 (x(R-b^R-a))- Both Rb^Ra and R'^Rb are 
positive definite, since Ryi,RB G ^pd and (R~^R^) = 

R^^Rb- It is easy to find that = x(R-b^I^x) 

and thereby we can obtain Theorem 1. 


Appendix C 

PROOE OE PROPOSITION 1 


We can lower bound the mutual information in (fTST i applying 
Minkowski Determinant Theorem 0341] 


Ia = log2det(l2x2+pM) 
> log 2 (l-P p det (M)^^^ 

= 2 log 2 


1 -P pexp ( - Indet (M) 


(44) 

(45) 

(46) 


where 

M = 




[W"hi,W"gi] 


G2X2 AGJ 


\h2lf 

i+ihTTF 

0 



hfW' 


gfW 


(47) 

(48) 

(49) 


By applying the convexity of log 2 (l -P re^),r > 0 and 
Jensen’s inequality, the ergodic rate of user A per slot can be 
lower bounded as 


Ra 


> 3 -Ei log2 


1 -P p exp ( — In det (M) 


> 


■l 0 g 2 


1 -P p exp I - [In det (M)] 


where E [Indet (M)[ = E [Indet (A)[ -P E j^lndet 
first term can be further calculated with equations in 


(S')]- 


(50) 

(51) 
The 


£ [In det (A)] = E 


In 


I (12 


1 -P |fi-21 P 

= eEi(-l)-27, 


+ E[ln(|fi3i|")] (52) 
(53) 


where (l53t is obtained by using the fact that \hj 


-Exp(l). 


In general, it is nontrivial to evaluate the second term. A 
special case lies in i.i.d Rayleigh fading channel where 


E 


In det 


(gg^) 


Wishart distribution 


can be exactly solved by invoking central 
j. Eor spatially coiTelated channel, we 


use Jensen’s inequality to upper bound the second term as 


E Indet (gG^) 


< Ini? 


det 


(gg^) 


(a 


= ln[E;(hf WW-f^higf WW^gi- 
hfWW^gigfWW^hi)] 


(6) 


= ln[E; (hf WW-f^hi) E (gf WW^gi) - 
E (hf WW^gigf WW^hi)] 


^=^ln( 0 .), (54) 

where 0* is defined in (fT4l l. Eq. (a) is obtained with 
det(AB) = det(A)det(B) for equal-size square matrices 
A, B. Eq. ( 6 ) is because hi and gi are independent Gaus¬ 
sian random vectors. Noting that Tr(AB) = Tr(BA) and 
E [Tr (C)] = Tr [E (C)], eq. (c) can be easily calculated. 

Einally, substituting ( 1531 ) and ( l54b into (EB renders an 
analytical approximation of the ergodic rate of user A and 
hence completes the proof. 
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Appendix D 

PROOF OF PROPOSITION 3 
Define M = and with simple manipula¬ 

tions, we have 


M = [W"hi,W"gi] 


_1_I -P 5 1^21 F 

fei fc2 

0 


= G 


2x2 


rG",p 


2 

A • 



hfW' 


gfW 


p 2 

(55) 

(56) 


Rewrite (l22l i as 

= £;[log 2 det(l 2 x 2 + M)] (57) 

= [log 2 det (1 + Tr(M) + det(M))] (58) 
< log 2 det[l + E (Tr (M)) + E (det (M))]. (59) 


Eq. (I 57 ]) is obtained with det(I + AB) = det(I + BA) 
while (l58l l makes use of the Cayley-Hamilton theorem. Then, 
we upper bound (fSST l by ( |59] | using Jensen’s inequality. With 
the help of ( l56b and i7[Tr(-)] = Tr[i7(-)], the first term in ( l59l l 
can be characterized as 

E [Tr (M)] = Tr [('s (Tn) W^R^W -P E (r 22 ) W^RbW) Pi] 

(60) 

~ + TA2P2) + ^A 2 (AbI-Pi + AB2P2) , ( 61 ) 

where Asi = wfRsWi, Ab 2 = wf RBW2,r^i = wf R^wi, 
Ta 2 = w|^R^W 2 and 


E{rii) = E 


(1 , 

\ki k 2 ) 


,E{T22)=E 




(62) 


rendering the problem too complicated to implement optimiza¬ 
tion technique^ Similarly, we can approximate i? (r 22 ) as 


E 


Palhsil 

ks 


= E 


hg,3Ch^,3 

P. 


Ti-(C) 

1 + Tr(D) 

(65) 


1 E TA3P9 E ^AsPlQ 

D = Ry^(PgWiwf -f Pioqiqf)Ry^ and C = 
PsRy^xixf^Ry^. The second term in (l5^ can be given by 

P[det(M)] = P[det(r)]-P [det (gg ")]-E [det (P^)] ( 66 ) 

-5ai5a.QaP.P2, (67) 


det^GG^^ follows {a),{b),{c 
(l63t ^ (l65l l into dhOli and (l 66 ll. w< 


where calculation of E 

of eq. (l5^ . Substituting (l63t ^ (l65l l into dhOl l and (166b . we 
can obtain (I 6 TI 1 and (|67]i. Combining dMl l and (l67T l with 
establishes (|25]l. 

In order to compute we can reexpress (l24l l as 


R^sa 


E 

E 


log2 1 -P 


Pelhfwaj" 


l-PPsl/laiP+PTlhfqaj^y' 


< log2 
log2 


1 -p p 
1 -pp 
TasP 


Pg|hfw3p A 


l + Pio|hfq3|V 


Pelhf W 3 I 

2 

1 + -f5|^2lP + P 7 

hrqsP 

P 9 |hfw 3 p \ 


1 -I- Pio|h|^q3P ) 



( 68 ) 


(69) 


>»«. (‘+ i+nfLp, )+(‘+ i+X?p.„ ) ^ 

(70) 


The terms on the right hand side of (l62l l can be further 
evaluated as follows: 


E 



= E 


> 


1-P 


IhfQPp 

1 


1-p P {|1 i[^QPbP) 1 “P AaiPs “P AA 2 P 4 


(63) 


An anaMical expression of dOSl) was obtained for the case 
M = 2 in 1^, while a lower bound for M > 2 case is derived 
in section [III] We here use Jensen’s inequality and (HTt in 
Lemma 2 to estimate (l68T l. leading to an approximation (1691 ) 
as well as (l25T l. 
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